Cues for hesitation in speech synthesis
نویسندگان
چکیده
The current study investigates acoustic correlates to perceived hesitation based on previous work showing that pause duration and final lengthening both contribute to the perception of hesitation. It is the total duration increase that is the valid cue rather than the contribution by either factor. The present experiment using speech synthesis was designed to evaluate F0 slope and presence vs. absence of creaky voice before the inserted hesitation in addition to durational cues. The manipulations occurred in two syntactic positions, within a phrase and between two phrases, respectively. The results showed that in addition to durational increase, variation of both F0 slope and creaky voice had perceptual effects, although to a much lesser degree. The results have a bearing on efforts to model spontaneous speech including disfluencies, to be explored, for example, in spoken dialogue systems.
منابع مشابه
Interactive Hesitation Synthesis: Modelling and Evaluation
Conversational spoken dialogue systems that interact with the user rather than merely reading the text can be equipped with hesitations to manage dialogue flow and user attention. Based on a series of empirical studies, we elaborated a hesitation synthesis strategy for dialogue systems, which inserts hesitations of a scalable extent wherever needed in the ongoing utterance. Previously, evaluati...
متن کاملModelling Hesitation for Synthesis of Spontaneous Speech
The current work deals with the modelling of one type of disfluency, hesitations. A perceptual experiment using speech synthesis was designed to evaluate two duration features found to be correlates to hesitation, pause duration and final lengthening. A variation of F0 slope before the hesitation was also included. The most important finding is that it is the total duration increase that is the...
متن کاملPeriodic cycles of hesitation phenomena in spontaneous speech
To verify whether hesitation phenomena are distributed periodically in spontaneous speech, twenty speech samples produced by five male adults were analyzed. Spectral analysis allowed for three main findings. First, hesitations present stationary behavior, which implies they did not accumulate in the beginning, in the middle, or in the end of speech samples. Second, periodic cycles of hesitation...
متن کاملInteractive Hesitation Synthesis and Its Evaluation
Conversational spoken dialogue systems that interact with the user rather than merely 1 reading text can be equipped with hesitations to manage the dialogue flow and the users’ attention. 2 Based on a series of empirical studies, we built an elaborated hesitation synthesis strategy for 3 dialogue systems that inserts hesitations of scalable extent wherever needed in the ongoing 4 utterance. So ...
متن کاملContextual Probability and Word Frequency as Determinants of Pauses and Errors in Spontaneous Speech
This study investigated the relationship between the contextual probability of lexical items in spontaneous speech, as measured by the Cloze procedure, and word frequency. It also attempted to determine the relative importance of the two variables in causing delay, in the form of hesitation, in the production of spontaneous speech. The analysis revealed that content words of low contextual prob...
متن کامل